Picture for Maosong Sun

Maosong Sun

Tsinghua University

Crafter: A Multi-Agent Harness for Editable Scientific Figure Generation from Diverse Inputs

Add code
May 28, 2026
Viaarxiv icon

Does Seeing More Mean Knowing More? Mono-Anchored Advantage Normalization for Multi-Source Visual Reasoning

Add code
May 25, 2026
Viaarxiv icon

Test-Time Deep Thinking to Explore Implicit Rules

Add code
May 24, 2026
Viaarxiv icon

DiffScore: Text Evaluation Beyond Autoregressive Likelihood

Add code
May 12, 2026
Viaarxiv icon

Khala: Scaling Acoustic Token Language Models Toward High-Fidelity Music Generation

Add code
May 03, 2026
Viaarxiv icon

NaviRAG: Towards Active Knowledge Navigation for Retrieval-Augmented Generation

Add code
Apr 14, 2026
Viaarxiv icon

VeriAgent: A Tool-Integrated Multi-Agent System with Evolving Memory for PPA-Aware RTL Code Generation

Add code
Mar 18, 2026
Viaarxiv icon

Cheers: Decoupling Patch Details from Semantic Representations Enables Unified Multimodal Comprehension and Generation

Add code
Mar 13, 2026
Viaarxiv icon

MERLIN: Building Low-SNR Robust Multimodal LLMs for Electromagnetic Signals

Add code
Mar 09, 2026
Viaarxiv icon

Imagination Helps Visual Reasoning, But Not Yet in Latent Space

Add code
Feb 26, 2026
Viaarxiv icon